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Abstract 

We consider the problem of counting the number of answers to 
a first-order formula on a finite structure. We present and study 
an extension of first-order logic in which algorithms for this counting 
problem can be naturally and conveniently expressed, in senses that 
are made precise and that are motivated by the wish to understand 
tractable cases of the counting problem. 


1 Introduction 

1.1 Background: counting query answers and complexity 

The computational problem of evaluating a formula (of some logic) on a 
finite relational structure is of central interest in database theory and logic. 
In the context of database theory, this problem is often referred to as query 
evaluation, as it models the posing of a query to a database, in a well- 
acknowledged way: the formula is the query, and the structure represents 
the database. We will refer to the results of such an evaluation as answers ; 
logically, these are the satisfying assignments of the formula on the struc¬ 
ture. The particular case of this problem where the formula is a sentence is 
often referred to as model checking, and even in just the case of first-order 
sentences, can capture a variety of well-known decision problems from all 
throughout computer science ||FG06j . 

In this article, we are concerned with the counting version of this problem 
in first-order logic, namely, given a first-order formula and a finite structure, 
output the number of answers (see for example | PSll , GSi , DM . iCMa j for 
previous studies). In the sequel, we assume by default that (unless other¬ 
wise specified) all formulas and sentences are first-order and relational. This 
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problem of counting query answers generalizes model checking, which can 
be viewed as the particular case thereof where one is given a sentence and 
structure, and wants to decide if the number of answers is 1 or 0, corre¬ 
sponding to whether or not the empty assignment is satisfying. In addition 
to this counting problem’s basic and fundamental interest, it can be pointed 
out that all practical query languages supported by database management 
systems have a counting operator. Indeed, it has been argued that database 
queries with counting are at the basis of decision support systems that han¬ 
dle large data volume [G'Sj . 

As has been previously articulated, a typical situation in the database 
setting is the evaluation of a relatively short formula on a relatively large 
structure. Consequently, it has been argued that, in measuring the time 
complexity of query evaluation tasks, one could reasonably allow a slow 
(non-polynomial-time) preprocessing of the formula, so long as the desired 
evaluation can be performed in polynomial time following the preprocess¬ 
ing |PY99llFG06j . Relaxing polynomial-time computation to allow arbitrary 
preprocessing of a parameter of a problem instance yields, in essence, the 
notion of fixed-parameter tractability. This notion is at the core of and is 
the primary tractability notion in parameterized complexity theory, which 
provides a taxonomy for classifying problems where each instance has an 
associated parameter. Following this motivation, whenever the problem of 
counting query answers (or the model checking problem) is considered us¬ 
ing parameterized complexity, in this article, the formula is taken to be the 
parameter. 

The problem of counting query answers is well-known to be computa¬ 
tionally intractable. It is possible to restrict this problem by considering 
restricted classes of queries, and then trying to understand which classes of 
queries are computationally well-behaved in that they give rise to a tractable 
case of the general problem. Precisely, for a class $ of first-order formulas, 
define count($) to be the problem where an instance is a formula 0 e $ 
paired with a finite structure B, and the output is the number of answers 
of 0 on B. We hence have a family of problems, one problem count($) 
for each such formula class $, and one can inquire which of these prob¬ 
lems are tractable (and which are not). We will also have cause to consider 
the case of model checking, so, when $ is a class of first-order sentences, 
define MC($) to be the model checking problem where an instance is a 
sentence 0 e $ paired with a finite structure B, and the output is yes 
or no depending on whether or not B satisfies 0. The complexity of the 
problem family MC($) has been considered in numerous papers; see for 
example [Gro07l IAW091IHDT21ICM131 IChel41 ICMbllChe] . 
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1.2 Background: width 

Width is a syntactic measure of logical formulas. The width of a first- 
order formula 0 is defined as the maximum number of free variables over 
all subformulas of 0. In studies of the problem family MC($), width has 
emerged as a crucially relevant measure; we now turn to explain how. 

Say that a class $ of first-order sentences has bounded width if there 
exists a constant k ^ 1 such that each 0 e $ has width at most k. It is 
now well-known that bounded width sentence classes are computationally 
desirable for model checking, made precise as follows. 

Observation 1.1 f VarQtff Suppose that $ is a sentence class having bounded 
width. The problem MC($) is polynomial-time decidable, via the algorithm 
that (given a sentence and a structure) simply computes the set of satisfying 
assignments for each subformula, inductively. 

As suggested above, it is known that a problem is fixed-parameter tractable 
if, after performing preprocessing on the parameter, an instance can be re¬ 
solved in polynomial time [FOQ61 Theorem 1.37]. For a problem MC($), 
one can readily observe that if each sentence 0 6$ can be algorithmically 
translated to a sentence lying in a class having bounded width, then MC($) 
is fixed-parameter tractable as a consequence of the previous observation. 
This can be formalized as follows. 

Observation 1.2 \Chelf | / Suppose that $ is a sentence class. The following 
condition, which we will call the classical condition, is sufficient for the 
problem MC($) to be fixed-parameter tractable: there exists a sentence class 
$' having bounded width and an algorithm f that computes, for each 0 6$, 
a logically equivalent sentence /(0) that is in $'. 

Research on the problem family MC($) has succeeded in obtaining classi¬ 
fications on classes $ of bounded arity where the quantifiers and connectives 
are restricted (see for examples [OroD7l ICM13i IChel4| ). An example rele¬ 
vant to the present article is work of Chen [Chel4j . who studied existential 
positive logic (by which mean the positive fragment of first-order logic con¬ 
sisting of formulas built from atoms, a, v, and 3), and, making crucial use 
of a hardness result by Grohe [Gro07j . observed that when $ is a class of 
existential positive sentences having bounded arity, the problem MC($) is 
fixed-parameter tractable if and only if the classical condition applies to $. 
(Here and elsewhere in our discussion, we assume the standard complexity- 
theoretic hypothesis that FPT ^ W[l].) That is, the sufficient condition for 
fixed-parameter tractability identified by Observation 11.21 is the exclusive 
explanation for fixed-parameter tractability, in the described setting of exis¬ 
tential positive logic. Let us remark that the query preprocessing algorithm 
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/ here (in Observation E2D is related to and akin to the database notion of 
a query optimizer that computes a query execution plan. 

Existential positive logic is a natural fragment of first-order logic |Rosf)8j 
and is studied heavily in database theory. Existential positive formulas 
include and are semantically equivalent to so-called unions of conjunctive 
queries , also known as select-project-join-union queries , which have been 
argued to be the most common database queries j.\IIV9-l . Recently, the 
present authors generalized the mentioned dichotomy on existential positive 
sentence classes, by presenting a classification theorem [ CM16] describing 
the fixed-parameter tractable problems of the form count(<f>), where is a 
bounded arity class of existential positive formulas. The classification the¬ 
orem is in fact a trichotomy theorem, which demonstrates that the studied 
problems count($) can exhibit three types of complexity behavior. 

1.3 Contributions 

As a means of introducing our contributions, we here wish to highlight a 
conceptual point: the applicability of the above classical condition indicates 
that for the model checking problem, first-order logic itself can be used as 
a model of computation in which desirable, efficient algorithms can be ex¬ 
pressed. This condition posits the existence of an algorithm that translates 
first-order sentences to a polynomial-time evaluable format; and, the par¬ 
ticular format used therein is that of a first-order sentence! Let us highlight 
that here, logic can be viewed as playing two complementary roles: on the 
one hand, the computational problems of interest are phrased directly in 
terms of logic; on the other hand, appropriate algorithmic solutions to this 
problem are themselves describable by logical sentences. 

Inspired by this perspective of logic as a useful model of computation , 
the present work was motivated by the desire to develop a logic that could 
serve as a useful model of computation for the problem of counting query 
answers—analogously to how first-order logic itself serves as a useful model 
of computation for the model checking problem. 

Let us point out some desiderata that such a logic ought to fulfill. First, 
recall that in the problem of counting query answers, an instance is a first- 
order formula paired with a structure, and the output is the number of 
answers. The hope, then, would be to be able to translate a first-order 
formula to a sentence -0 in the logic such that evaluating if on a structure 
returns the number of answers to i ft on the structure; hence, in the logic, 
the evaluation of a sentence on a structure ought to return a numerical 
quantity , instead of a propositional value as in usual first-order logic. Second, 
the logic must accommodate the fact that, while the classical condition 
explains all tractable cases of model checking in existential positive logic 
(in the sense made precise), there are classes $ of queries on which the 
classical condition does not hold, but for which the problem count(<3?) is 
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fixed-parameter tractable. For instance, consider the class 0 = {9 n \ n ^ 1} 
where 9 n (x i, ... ,x n ) = U\(x\) a • • • a U n (x n ). The number of answers for 
6 n on any structure B is straightforwardly verified to be Y\a =i II j given 
9 n and B, this quantity can indeed be computed in polynomial time, but it 
can be verified that the classical condition does not apply to 0. 

(j-logic. In this article, we introduce and study a logic, which we call ft- 
logic and which possesses the sought-after characteristics just described. In 
fj-logic, the evaluation of a Jj -sentence (a type of formula in jj-logic) on a 
structure returns an integer value. 

From the discussed viewpoint of bounded width as an explanation for 
the tractability of model checking, the relationship of the counting query 
answers problem to $-logic is strongly analogous to the relationship of model 
checking to usual first-order logic. The following parallel of Observation ll.il 
holds. (Note that the width of a formula in jj-logic will be defined in a natural 
way.) 

Observation 1.3 Suppose that 'k is a (j -sentence class having bounded width. 
The problem count(’k) is polynomial-time computable. 

As expressed, one purpose of (j-logic is to allow for the translation of a 
first-order formula cp to a jj-sentence ip such that ip represents <p in that, for 
any structure B, the number of answers to cp on B is equal to the quan¬ 
tity that results from evaluating ^ on B. The following is an immediate 
consequence of the previous observation, and a parallel of Observation 11.21 

Observation 1.4 Suppose that $ is a first-order formula class. The follow¬ 
ing condition, which we will call the counting condition, is sufficient for the 
problem count( < I > ) to be fixed-parameter tractable: there exists a fl -sentence 
class T having bounded width and an algorithm f that computes, for each 
(p e <3?, a jj- sentence f{cp) that is in T and that represents cp. 

Of course, Observation PI is only of interest if the counting condition 
applies to interesting first-order formula classes <h, that is, only if this condi¬ 
tion allows one to explain the fixed-parameter tractability of such problems 
count(<h). We in fact show that for any problem count(<b) that is tractable 
according to the mentioned classification theorem on existential positive 
logic, the counting condition applies to T. That is, the counting condition is 
the exclusive explanation for the tractability of count(T) in this existential 
positive setting, providing an analog to the result that the classical condition 
is the exclusive explanation for the tractability of MC(^) in the existential 
positive setting. On a conceptual level, we view our result as strong evi¬ 
dence that, for the problem of counting query answers, jj-logic is a useful, 
expressive model of computation in which relevant, efficient algorithms can 
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be presented. This result is obtained as an immediate consequence of two 
theorems: 

• We show that when such a problem count(<h) is tractable, then there 
exists a bounded width class T of (-sentences such that each cf> e <f> 
has a representation in T (Theorem 14.31) . 

• We prove that there is a minimization algorithm that, given an exis¬ 
tential positive formula, computes a representation of minimum width 
(Theorem [TT]) . 

The latter theorem can be read as demonstrating that (i-logic is well- 
characterized and well-understood as a model of computation: conceiving 
of a ((-sentence representation of an existential positive formula as a com¬ 
putational procedure for counting query answers, this theorem provides a 
minimization algorithm that always outputs an optimal procedure for a 
given existential positive formula, where optimality here is measured using 
width. 

In short, our presentation and study of ((-logic forwards the discussed use 
of logic as a means for expressing computationally desirable procedures; in 
particular, ((-logic allows for the direct expression of procedures for counting 
query answers. 

Counting homomorphisms. The problem of counting the number of 
homomorphisms from a given source structure A to a given target structure 
B arises and has been studied in numerous contexts [GT , Ihil 13 . This prob¬ 
lem can be viewed as the special case of counting query answers where the 
formula is a quantifier-free conjunction of atoms; there is indeed a correspon¬ 
dence that allows one to pass from a source structure A to such a formula 4>\ 
which originates from the classical work [CM77| (and which is explained in 
Section [2]) . This problem is now well-known to be polynomial-time tractable 
under a constant treewidth bound on the permitted source structures; in¬ 
deed, the corresponding algorithm, which performs dynamic programming 
over a tree decomposition of A, has received a textbook treatment [FKIf)L 
Section 5.3]. 

We discuss how, from a tree decomposition for a structure A, one can 
polynomial-time compute a ((-sentence ip that has width at most the width 
of the given decomposition (plus one), and that represents A in the sense 
that evaluating ip on an arbitrary structure B always yields the number of 
homomorphisms from A to B (Example 13.41) . Combining this result with 
Observation 11.31 we obtain that the algorithm of this well-known polynomial¬ 
time tractability result can be expressed in ((-logic. Indeed, we believe that 
the resulting (-sentences accurately, faithfully, and cleanly describe the ex¬ 
ecution of this algorithm. 


6 








A dual perspective on a classical Lovasz theorem. Fix a relational 
signature r; in the following discussion, all structures are finite and on r. 
Let B be a structure, let str[r] denote the class of finite structures on r, 
and let L{ B) be the vector from Q str H that maps a structure A e str[T] to 
the number of homomorphisms from A to B. A classical theorem of Lovasz 
states that, for any two structures B, B', it holds that L( B) = L(B') if and 
only if B and B' are isomorphic | Lov67 ]. 

When one is concerned with homomorphisms from one structure A to 
another structure B, sometimes, the structure A is referred to as the left- 
hand structure and the structure B is referred to as the right-hand structure. 
The vectors studied by Lovasz indicate, for a structure B, the number of 
homomorphisms coming from each possible left-hand side structure. One 
can naturally formulate a dual vector, as follows. For any structure A, 
define R( A) to be the vector from Q str M that maps a structure B e str[r] 
to the number of homomorphisms from A to B. That is, the vector R( A) 
indicates, for a structure A, the number of homomorphisms to each possible 
right-hand side structure. 

Our previous work [ CM16] implied a dual of Lovasz’s theorem, namely, 
that for any two structures A, A', it holds that R( A) = R(A') if and 
only if A and A' are isomorphic. In the present work, we prove and use 
a natural generalization of this fact, namely, that for any finite sequence 
Ai,...,Afc of pairwise non-isomorphic structures, the vectors i?(Aj) are 
linearly independent (Theorem 15.111 . This linear independence theorem is 
used as a key tool to establish the correctness of our minimization algorithm. 
We believe that it should play an important role in future studies of counting 
query answers, and that the techniques and concepts that its proof requires 
may be of independent and future interest. Indeed, to prove this theorem, 
we extend notions and techniques from the work of Lovasz [ Lov67| : for 
example, we introduce and crucially use a notion of multivariate polynomial 
associated to a primitive positive formula. 

1.4 Discussion 

Logics with counting mechanisms have been considered in finite model the¬ 
ory and descriptive complexity; one well-known example is the counting logic 
studied by Immerman and Lander 11.90 . A typical motivation in these ar¬ 
eas for studying such logics is the desire to extend first-order logic so as to 
allow for the capture of properties not expressible in first-order logic. This 
motivation contrasts somewhat with ours here; our objective is to introduce 
logics that allow for the relatively direct expression of useful algorithms for 
the problem of counting query answers. We believe that it could be of inter¬ 
est to try to understand the relationship (if any) between existing counting 
logics and the logics studied in the present work. 

We wish to emphasize that, as regards our present motivations, our 
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logiqj trades off expressivity and computability properties in an extremely 
desirable fashion. On the one hand, the algorithms for the tractable cases of 
count(<h) (where <f> is existential positive) can be expressed in our logic, as 
described above; on the other hand, our minimization algorithm described 
above evidences that the measure of width can be computably minimized in 
our logic, and is thus in a certain sense well-characterized. Let us remark 
here that previous work established that there is no algorithm for minimizing 
width in positive first-order logic [BC141 Section 5]. 

We believe that our introduction of ({-logic may open up further research 
directions. One particular question for future research that we may pose is 
whether or not there are Ehrenfeucht-Fra'isse style games for proving inex- 
pressibility in bounded width fragments of our logic. 

2 Preliminaries 

Logic. We assume basic familiarity with the syntax and semantics of first- 
order logic. In this article, we focus on relational first-order logic where 
equality is not built-in to the logic. Hence, each vocabulary/signature un¬ 
der discussion consists only of relation symbols. We assume structures un¬ 
der discussion to be finite (that is, have finite universe); nonetheless, we 
sometimes describe structures as finite for emphasis. We use the letters 
A, B, ... to denote structures, and the corresponding letters A, B , ... 
to denote their respective universes. When A, B are structures over the 
same signature r, a homomorphism from A to B is a mapping h : A —> B 
such that, for each Ret and each tuple (ai,... ,a k ) e i? A , it holds that 
( h(ai ),.. .,h(a k )) e R B ■ 

We use the term fo-formula to refer to a first-order formula. An ep- 
formula (short for existential positive formula ) is a fo-formula built from 
atoms (by which we refer to predicate applications of the form R(v i,..., v k ), 
where R is a relation symbol and the Vi are variables), conjunction (a), 
disjunction (v), and existential quantification (3). A pp-formula (short 
for primitive positive formula ) is an ep-formula where disjunction does not 
occur. An fo-formula is prenex if it has the form Q\V\ ... Q n v n 9 where 9 
is quantifier-free, that is, if all quantifiers occur in the front of the formula. 
The set of free variables of a formula cf is denoted by free(^) and is defined as 
usual; a formula is a sentence if free(</>) = 0. We define an ep-sentence to 
be an ep-formula that is a sentence, and define fo-sentence and pp-sentence 
similarly. 

We now present some definitions and conventions that are not totally 
standard. A primary concern in this article is in counting satisfying assign¬ 
ments of fo-formulas on a finite structure. The count is sensitive to the set 
of variables over which assignments are considered; and, we will sometimes 
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want to count relative to a set of variables that is strictly larger than the 
set of free variables. Hence, we will often associate with each fo-formula 0 a 
set V of variables called the liberal variables, denoted by lib(0), for which it 
is required that lib(0) 2 free(0). We generally assume that the variables in 
lib(0)\free(0) are not used in 0. To indicate that V is the set of liberal vari¬ 
ables of 0, we often use the notation 0(H); we also use 0(tq,..., v n ), where 
the Vi are a listing of the liberal variables. Relative to a formula 0(E), 
when B is a structure, we will use 0(B) to denote the set of assignments 
/ : V —> B such that B, / |= 0. We call an fo-formula 0 free if free(0) A 0, 
and liberal if lib(0) is defined and lib(0) A 0. 

Example 2.1 Consider the formula <f>(x,y,z) = E{x,y ) v F(y,z). Define 
if(x,y,z) = E(x,y) and 0'(x,y, z) = F(y,z). The notation is intended 
to indicate that lib(0) = lib (VO = lib(0') = {x,y,z}. In the context of 
studying 0, it is natural to define lib (VO and lib (VO to be {x,y,z}; under 
this definition, it holds that 0(B) = 0(B) u 0 / (B), but this would not hold 
in the case that (say) lib(0) was defined as {x, y} (which set is equal to 
free(0)). □ 

pp-formulas. It is well-known [CM77J that there is a correspondence be¬ 
tween prenex pp-formulas and relational structures. In particular, each 
prenex pp-formula 0(5) (on signature r) with lib(0) = S may be viewed 
as a pair (A, S ) consisting of a structure A (on r) and a set S'; the universe 
A of A is the union of S with the variables appearing in 0, and the following 
condition defines the relations of A: for each R e r, a tuple (ai,..., a*,) e A k 
is in R a if and only if R(a\, ..., a*,) appears in 0. In the other direction, such 
a pair (A, S) can be viewed as a prenex pp-formula 0(S) where all variables 
in A\S are quantified and the atoms of 0 are defined according to the above 
condition. A basic known fact [CM77] that we will use is that when 0(S) is 
a pp-formula corresponding to the pair (A,S), B is an arbitrary structure, 
and / : S —> B is an arbitrary map, it holds that B, / (= 0(S) if and only 
if there is an extension f of / that is a homomorphism from A to B. We 
will freely interchange between the structure view and the usual notion of a 
prenex pp-formula. For a prenex pp-formula specified as a pair (A, S), we 
typically assume that 5 c A. 

Example 2.2 Consider the pp-formula 0(rt, v, w, x) = 3y(E(u, v) aF(w, y)). 
To convert 0 to a structure A, we take the universe A of A to be the union 
of lib(0) with the variables appearing in 0, so A = {u, v, w, x, y}. The rela¬ 
tions of A are as defined above, so E A = {(u,v)} and F A = {(w,y)}. The 
resulting pair representation of 0 is (A, {u, v, w, 2 }). □ 

Graphs. Throughout the paper, all graphs under discussion should be 
assumed to be undirected by default. 
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To every prenex pp-formula (A, S ) we assign a graph whose vertex set 
is A u S and where two vertices are connected by an edge if they appear 
together in a tuple of a relation of A. A prenex pp-formula (A, S) is called 
connected if its graph is connected. A prenex pp-formula (A', S') is a compo¬ 
nent of a prenex pp-formula (A, S') over the same signature r if there exists 
a set C that forms a connected component of the graph of (A, S'), where 
S' = S n C, and for each relation Ret, a tuple (ai,..., a*,) is in R A if and 
only if (ai,..., a*,) e R A n C k . 

Note that when this holds, the graph of (A', S') is the connected compo¬ 
nent of the graph of (A, S) on vertices C. We will use the fact that, if 4>(V) 
is a prenex pp-formula and cf> i(Vf),».. ,4>kiYk) is a list of its components, 
then for any finite structure B, it holds that |0(B)| = D[i=i |0j(B)|. 

Example 2.3 Consider the formula © from Example 12.21 The connected 
components of the graph of 0 can be readily verified to be {u, v}, {w, y}, and 
{x}. Hence, the pp-formula 4> has three components, which can be readily 
verified to be 4>i(u,v) = E(u,v), (j> 2 (w,y) = 3yF(w,y), and 4>3(x) = T; here, 

T denotes the empty conjunction. □ 

Treewidth. We give some basic facts about tree decompositions and treewidth; 
see for example (FG06 ] for more details. 

A tree decomposition of a graph G = (V,E) is a pair (T,(B t )teV(T)) 
where T is a tree and ( B t ) teV ( t) is a family of subsets of V such that (1) 
for every v e V, the set {t e V(T) \ v e Bt} is non-empty and connected in 
T, and (2) for every edge uv e E, there is a t e V(T) such that u,v e B t . 

The width of a tree decomposition (T, {B t ) t eV(T)) is defined as max{|Bt| : 
t e V(T)} — 1. The treewidth tw(G') of G is the minimum width over all 
the tree decompositions of G. Computing tree decompositions of minimal 
width is fixed-parameter tractable parameterized by the treewidth |Bod96j . 

A tree decomposition is called nice, if every t e V ( T) is of one of the 
following types: 

• (leaf) t has no children and \Bt\ = 1. 

• (introduce) t has one child t' and Bt = Bf u {v} for a vertex v 6 V\Bf. 

• (forget) t has one child t' and Bt = B t >\{v} for a vertex v 6 Bf. 

• (join) t has two children t\,t 2 with Bt = B ^ = Bt 2 . 

It is well-known that a width k tree decomposition of G can be converted 
to a width k nice tree decomposition, in polynomial time. 
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3 fj-logic 

In this section, we present the syntax and semantics of jj-logic, as well as 
some associated terminology. Syntactically, jj-logic consists of jj -formulas; 
each jj-formula <p> has an associated set of free variables, denoted by free(^), 
as well as an associated set of closed variables, denoted by closed (c/>). Let 
B be a structure, let ^ be a jj-formula over the signature of B, and let 
h : free(</>) —* B be an assignment. Semantically, evaluating p with respect 
to B and h returns an integer value, as opposed to a propositional value (as 
for a fo-formula); to present the symantics of jj-logic, we will notationally 
use [B,0] to denote the mapping taking an assignment h : free(</>) —» B to 
the integer value provided to the triple consisting of 0, B, and h. 

Before turning to the formal definition of jj-logic, we provide an example 
that previews some aspects of this logic. 

Example 3.1 We consider the pp-formula cj>(x,y,z) = E(x,y) a F(y,z). 
The first type of jj-formula is a casting of a fo-formula; define ip to be 
C(E(x,y), {x,y}), which is a jj-formula with free variables {x, y}. Let B 
be a structure, and let h : {x,y} —> B be an assignment; [B ,ip\(h) is equal 
to 1 or 0 depending on whether or not B, h j= E(x,y) (respectively). Once 
one has a jj-formula, it is possible to define a further jj-formula by projecting 
free variables. For example, P{y}ip is a jj-formula with free variables {x}. 
When g : {x} —► B is an assignment, the value [B, P{y}i/>](g) is the number 
of extensions h : {x, y} —> B of g such that B, h |= E(x, y). In an analogous 
fashion, one may define ip 1 to be the jj-formula C(F(x, z ), {x, z}); then, when 
g : {x} —* B is an assignment, the value [B, P{z}ip'\(g) is the number of 
extensions h! : {x, z} —> B of g such that B, h! (= F(x, z). 

Now, let g : {x} —> B be an assignment. Observe that the number of 
extensions h + : {x,y,z} —► B of g such that B ,h + \= E(x,y) a F(x,z ) is 
equal to the product of [B, P{y}ip\(g) and [B, P{z}ip'](g). A product con¬ 
nective x is provided by jj-logic, and said product is equal to [B, {P{y}ip) x 
(P{z}ip')\(g). Finally, 0 = P{x}((P{y}ip ) x (P{z}ip')) is a Jj-formula with no 
free variables; letting e denote the empty assignment, [B, 0](e) will be equal 
to the sum, over all assignments g : {x} —> B , of [B, ( P{y}ip ) x (P{z}'ip')](g), 
which is equal to the number of assignments / : {x, y, z} —> B such that 
B,/ |= p. Phrased in terminology introduced below, the jj-formula 6 repre¬ 
sents the fo-formula (p(x. y.z). □ 

3.1 Syntax 

We define ((-formulas inductively, as follows. 

• L ) is a jj-formula if (j) is a fo-formula 

and L 2 free(^). 

Define free(CV/») = L and closed(C0) = 0. 
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• PVcf is a ((-formula if <j> is a jj-formula and V is a set of variables with 

V n closed(0) = 0. 

Define fr ee(PVcf) = fr ee((f)\V 
and closed (PV <0 = V u closed(</>). 

• EV4> is a 0-formula if 0 is a jj-formula and V is a set of variables with 

V n (free(0) u closed(V>)) = 0. 

Define fr ee(EV<j)) =fu free(0) 
and closed {EV(j)) = closed ((/>). 

• (j) x (j)' is a ((-formula if (f and (ft are ((-formulas with free(0) = fr ee(<ft) 
and closed ((j>) n closed ((ft) = 0. 

Define free(0 x (j)') = free(V/>) 

and closed(V> x (ft) = closed(V>) u closed(V>'). 

• (f + (ft is a ((-formula if (f and (ft are ((-formulas with f'ree(r/>) = free((ft). 

Define fr ee((f + (j)') = free(V>) 

and closed (cf + <ft) = closed(<0 u closed ((ft). 

• n is a ((-formula if n e Z. 

Define free(n) = 0 
and closed (ra) = 0. 

A formula C(<f,L) can be thought of as the casting of a fo-formula q i> 
into a ((-formula; the P quantifier can be thought of as projecting or closing 
variables; and the E quantifier can be thought of as expanding the set of free 
variables. The connectives x and + perform the usual arithmetic operations. 

Let ^ be a ((-formula. A ft-subformula 6 of -0 is defined as a ((-formula 
such that, when if and 6 are viewed as trees, 6 appears as a subtree of if. A 
fo-subformula of if is a subformula of any fo-formula (f such that if contains 
a ((-subformula of the form C(cf,L). A subformula of if is a ((-subformula or 
fo-subformula of if. We define width(^) to be the maximum of |free(#)| over 
all subformulas 0 of if, and ((-width(g/ 7 ) to be the maximum of |free(0)| over 
all ((-sub formulas 6 of if. We say that if is a (j- sentence if free (VO = 0. 

We define a ^PP-formula to be a ((-formula where, in each O-subformula 
of the form C(cf,L), (f is a pp-formula; the notion of ^EP-formula is defined 
analogously. We define a OPP-senfence to be a OPP-formula that is a jj- 
sentence, and we define a OEP-senience similarly. 

3.2 Semantics 

We define the semantics of our logic as follows. 

• When C((f,L ) is a ((-formula, 

[B, 4>\{h) = 1 if B, h |= <f\ [B, <f](h) = 0 otherwise. 
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• When PV<t> is a {(-formula, 

[B, PV(f>](h) = Yj h ' [B, 4>\{h'), where the sum is over all extensions 

h! : free(0) u V —*• B of h. 

• When EV(j) is a {{-formula, 

[B ,EVf](h) = [B,f\(h rfree(^)). 

• When (f> x cj)’ is a {{-formula, 

[B, 0 x f'](h) = [B,f](h) • [B,<//](/i). 

• When f + cj)’ is a ({-formula, 

[b A+vm = [B,<j>](h) + [v,<t>'m- 

• When n is a {{-formula, 

[B, n](h) = n. 

We consider two ({-formulas f, (ft with free(</>) = free(^> / ) to be logically 
equivalent if for each structure B, it holds that [B,0] = [B, (j)']. A tt- 
sentence if represents or is a representation of a fo-formula f(V) if for each 
finite structure B, it holds that |</>(B)| = [B,^]^), where e is the empty 
assignment. For simplicity, when if is a {(-sentence, we will typically write 
[B, if] in place of [B, ip](e). We will use the term jjPP -representation to refer 
to a representation that is a jjPP-formula, and define j(EP -representation 
similarly. 

We make the basic observation that each fo-formula has a representation. 

Proposition 3.2 For each fo-formula f(V), the (j -sentence PVC(f,V) is 
a representation of <f(V). 

3.3 Examples 

Example 3.3 Define the formula 4>(xq, x\, X 2 , Vo, yi, yf) as fo a cj>\ a f 2 , 
where <f t = 3ziTi(xi, Xj+i, yi, zf) for each i e {0,1,2}; here, the quantity 
i + 1 appearing in X{ + \ is computed modulo 3. These formulas are over the 
vocabulary {To,Ti,T 2 } having three relation symbols, each of arity 4. 

Define ifi = P{yi}C(fi,{xo,xi,X 2 1 yi}) for each i e {0,1,2}. Observe 
that when B is a structure and h : {xo, xi, X 2 } —>• B is a map, [B, ipi\(h) gives 
the number of extensions h' : {xo,xi,X 2 , Vi] —» B of h satisfying <f>i on B. 

We have free(V’o) = free(V’i) = free(^ 2 ) = {xo, xi, X 2 } and closed(V’i) = {*/?:}• 

Now consider if = (if 0 x iff) x if 2 . It can be verified that, for a structure 
B and a map h : {xo,xi,X 2 } —► B , [B ,if](h) gives the number of extensions 
h! : {xo, xi, X 2 , yoi Vii 2 / 2 } —► B of h satisfying f on B. It follows that the 
{(-sentence P{xo, xi, a^}^ is a representation of f. Note that this represen¬ 
tation of f has width equal to max(3, width(^o)) width(^i), width}^)) = 4. 

As a further remark, consider, as an example, the subformula C(fo, {xo, xi, X 2 , yo}) 
of ifo- It holds that X 2 f free(</>o), and so E{x 2 }C(fo, {xo, xi, yo}) is a jj- 
formula and is logically equivalent to C(f 0, {xq, xi, X2, yo })- □ 


13 



Example 3.4 Consider a prenex pp-formula 4> without quantifiers whose 
structure view has the form (A, A). Suppose that (T, {B t )) is a width k tree 
decomposition of the graph of (A, A); without loss of generality, we may 
assume that this tree decomposition is nice, and that the root node r of T 
has B r = 0. We explain how to give a representation of (j) having width 
^ t 1. 

Say that a node u of T is below a node t of T if t occurs on the unique 
path from u to the root of T (this is understood to hold in particular when 
u = t). When t is a node of T, define B^ t to be the union of B u over all 
nodes u that are below t. 

We show that, for each node t of T, there exists a ((-formula 0 such that: 

• freest) = B t , 

• closed(0) = B^ t \B t , and 

• for any structure D, the value [D ,0]0) is equal to the number of 
extensions h! : B^ —> D of h such that (D, h') satisfies each atom 
R(a ±,..., ak ) of 0 whose variables all fall into a bag B Ul with u below 
t. 

We give a ((-formula depending on the type of the node f; we use the notation 
from the definition of nice tree decomposition. 

• (introduce) 0 = (Et;0/) x C(ai,B t ) x ••• x C(a m ,B t ) 

where the cq are the atoms of whose variables fall into Bt. Note 
that the presence of the C^cq, B t ) ensures that [D, = 0 if (D, h) 

does not satisfy all of the cq. 

• (leaf) ifit is defined as in the previous case, except V’i' is taken to be 
C( T, 0), where T is the true value. 

• (forget) V>t = P MVv- 

• (join) ijj t = 'ipn x 0 2 ■ In this case, 01 x 0 2 is a ((-formula since 
free(0 1 ) = B tl = B t2 = free(0 2 ), and, by the definition of tree de¬ 
composition, B^t^Bt! = closed(0 1 ) and B<z t2 \B t2 = closed(0 2 ) are 
disjoint. 

The desired representation is 0. The claim on the width of 0 holds, as 
each jj-subformula 0 of 0 has free(0) = B w for a node w of T, and each 
fo-subformula of 0 is either T or an atom whose variables fall into a bag 
B t . □ 
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4 Main theorem statements 


The treewidth of a prenex pp-formula <p, denoted tw(^>), is defined as the 
treewidth of the graph of <p. The following notions are adapted from [CMal 
ICM16] . Let (A, S) be a prenex pp-formula with graph G. An 3-component 
of (A, S) is a graph of the form GfV 7 ] where there exists V c A that is a 
connected component of G[A\5] and V' is the union of V with all vertices 
in S having an edge to V. Define contract(A, S) to be the graph on vertices 
S obtained by starting from G*[5] and adding an edge between any two 
vertices that appear together in an 3-component of (A, 5)H A core of (A, S ) 
is a prenex pp-formula (c(A), S) where c is an A-endomorphism fixing each 
s e S that has minimum image size; by c(A), we mean the structure with 
universe c(A) and where R c = {c(t) \ t e i? A }; here, c(t) denotes the 
tuple obtained by applying c to each entry of t. It is known that any core of 
a prenex pp-formula (f> is logically equivalent to 4> [CM7 7]. We speak of the 
core of a pp-formula, as it is unique up to isomorphism; this follows from 
the basic theory of cores [HN92j . 

A class of prenex pp-formulas satisfies the contraction condition if the 
class containing each graph contract^), where ip is the core of a formula in <L, 
has bounded treewidth; d> satisfies the tractability condition if it satisfies the 
contraction condition and the cores of formulas in <f> have bounded treewidth. 
Previous work showed that, for bounded arity <h, count(<I>) is fixed-parameter 
tractable if $ satisfies the tractability condition; interreducible with the 
parameterized clique problem if 4> satisfies the contraction condition but 
not the tractability condition; and as hard as the parameterized counting 
clique problem otherwise (see |CMa] for a precise statement). 

We first study representations of pp-formulas, obtaining the following 
theorems. 

Theorem 4.1 Let T be a class of prenex pp-formulas. 

• The class <f> satisfies the tractability condition if and only if there exists 
k 5= 1 such that each formula in <f> has a jjPP -representation ft having 
width(0 / ) ^ k. 

• The class $ satisfies the contraction condition if and only if there exists 
k ^ 1 such that each formula in T has a jjPP -representation ft having 
fj-width(0 / ) ^ k. 

Theorem 4.2 There exists an algorithm that, given a prenex pp-formula <f, 
outputs a (JPP -representation ip of cj) of minimum width. 

Building on this understanding of pp-formulas, we are then able to 
achieve general versions of these theorems for ep-formulas. Previous work 

2 Note that in previous articles ICMal [CM16I . this graph was defined in terms of the 
core of the given formula. 
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showed that for any class 4> of ep-formulas, there exists a class 4> + of 
prenex pp-formulas such that the problems count(<h) and count( < h + ) are 
interreducible (see [CMlfij for a precise statement), and hence (for example) 
whether or not the tractability condition holds on 4> + determines whether 
or not count(d*) is fixed-parameter tractable. 

Theorem 4.3 Let 4> be a class of ep-formulas. 

• The class 4> + satisfies the tractability condition if and only if there 
exists k ^ 1 such that each formula in 4> has a j)EP -representation (ft 
having width {(ft) ^ k. 

• The class 4> + satisfies the contraction condition if and only if there 
exists k ^ 1 such that each formula in 4* has a j)EP -representation (ft 
having jj-width((//) k. 

Theorem 4.4 There exists an algorithm that, given an ep-formula <f>, out¬ 
puts a BEP -representation if of of minimum width. 

5 Independence theorem 

In this section, we establish a key tool for reasoning about BEP-formulas. 

Define two fo-formulas 4>(V ), cft(V') to be counting equivalent if, for each 
structure B. it holds that |</>(B)| = |0'(B)|; note that a decidable character¬ 
ization of counting equivalence on pp-formulas is known (see Theorem ID.21) . 

In the scope of this article, define a linear combination to be an expres¬ 
sion of the form YjILi c i|^i(ti)|, where each c* is a non-zero rational number 
and the (fiiVi) are pp-formulas that are pairwise not counting equivalent. 
Each linear combination i naturally induces a mapping £(■) from finite struc¬ 
tures to Q. The following theorem will be key for our understanding of 
equivalence of jjEP-formulas. 

Theorem 5.1 (Independence theorem) For any non-empty linear combina¬ 
tion i, there exists a finite structure D such that £(D) ^ 0. 

We devote the rest of this section to proving this theorem. By multiply¬ 
ing all values c* by a multiple of their denominators, we can and will assume 
that each value Ci is an integer. In order to establish this theorem, we will 
make use of an action of a univariate polynomial on a structure; this action 
has the following commutativity property. 

Lemma 5.2 (Lovasz lLov6 ftj) For each connected liberal pp-formula 4>, each 
univariate polynomial p with positive integer coefficients and each finite 
structure B, it holds that |0(p(B))| = p(10(B)|). 
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The following lemma shows that certain pp-formulas can be controlled 
independently of each other, made precise as follows. 

Lemma 5.3 Let (fi(Si),... ,(j) n (S n ) be connected liberal pp-formulas (over 
signature t) that are pairwise not counting equivalent. Then for every 
m ^ 2, there exist structures (B( aii _ jan ) | (ai,..., a n ) e [m ] n ) and injective 
functions fi,..., f n : \m\ —* N such that for each (ai,..., a n ) e \m\ n and 
each i e [n\, it holds that |0j(B( ai! ... )0n ))| = fi(cn)- Moreover, when A is any 
structure on which |</>j(A)| > 0 for each i e \n\, each structure R( ai ,...,a n ) can 
be chosen in the form A x E, where E is a structure such that |0(E)| > 0 
for all pp-formulas over r. 

We now introduce a highly useful notion, that of component polynomial. 
Fix a set V of liberal variables. Denote by £ the set of counting equivalence 
classes of liberal connected pp-formulas (with liberal variables from V). A 
component polynomial q is a multivariate polynomial with integer coefficients 
over variables {X e \ e e £}. For any finite structure B, we define the value 
of q evaluated on B, denoted by g[B], as the integer value obtained by 
evaluating q when each X e is given the value |0 e (B) |, for a formula 4> e e e. 
The following is our main theorem on component polynomials. 

Theorem 5.4 When q is a non-zero component polynomial, there exists a 
finite structure B such that g[Bj / 0. Moreover, when </>i(Si),... ,<f n {S n ) 
are representatives of the equivalence classes e\,... ,e n e £ whose corre¬ 
sponding variables X ei appears in q, the structure B may be picked as a 
structure of the form provided by Lemma 15.31 

In order to establish this theorem, we will make use of the following 
known fact concerning multivariate polynomials; see for example |Alo991 
Lemma 2.1] for a proof. 

Proposition 5.5 Let p(x i,..., x n ) be a multivariate polynomial in n vari¬ 
ables over a field F. For each i e \n\, let di denote the degree of p as a poly¬ 
nomial in Xi, and suppose that Ti c F is a set of size di + 1 or greater. Then, 
if p is not the zero polynomial, there exists a point (ti,..., t n ) e T\ x • ■ ■ x T n 
such that p(ti ,..., t n ) 7^0. 

Proof. (Theorem 15.41) Let (j>i(Si ),..., (f n {S n ) be as described in the theo¬ 
rem statement. Let m J 5 2 be a value that exceeds the degree of each of the 
variables X ei ,... ,X en in q, and apply Lemma HOI to obtain structures 

( B (a 1 ,...,a„) I (oi,...,On) e N") 

and the corresponding functions /i,...,/ n : \m\ —* N. Evaluating q on 
these structures amounts to evaluating q when the variables (X ei ,..., X en ) 
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are given values in f\ ([m]) x x f n ([m]). By Proposition 15.51 q must 
evaluate to a non-zero value on one of these structures. □ 

Proof. fTheorem l5.ll) Denote l by 22=1 c i\fa(Yi)\ and let (Ai, Pi),..., (A m , V m ) 
be the pairs corresponding to the formulas 4>i(Vi ),..., (j)m(V m ). By rearrang¬ 
ing the indices, we may assume for the sake of notation that Ai,..., A& are 
homomorphically equivalent structures (where k £ [m]) and that for no i 
with k < i < m does A* have a homomorphism to Aj. 

For any structure B, it holds that one of the values |0i(B)|,... , [<^t(B)| 
is non-zero if and only if all of them are. Now, for each i, define fa(Vi) 
from fa (Vi) by removing non-liberal components, that is, by removing each 
atom whose variables fall into a non-liberal component. Note that for ev¬ 
ery i £ [fc] and for every structure B such that |</>j(B)| > 0, we have that 
|</>i(B)| = |0j(B)l. Since the fa are pairwise not counting equivalent, it 
follows that the fa are pairwise not counting equivalent. For each formula 
fa(Vi), by considering its liberal connected components, we may define r* to 
be a component polynomial which is a product of variables from {X e \ e £ £} 
such that |0i(B)| = j-j[[B] for all finite structures B. The products ri,..., 
are pairwise distinct, so r = ciri + - • -+ 0 ^^ is a non-zero component polyno¬ 
mial. By applying Lemma [5 . 31 with A = Ai and then invoking Theorem [53 
we obtain a finite structure D of the form A x C such that r[D]] / 0. Since 
there is by assumption a homomorphism from A,; to A and there is by The¬ 
orem 5.4 a homomorphism from Aj to C, we have |^>j(D)| > 0 for every 
i £ [k\. Consequently, |</>j(D)| = 0j(B) by the observation from above. 
Since no structure A,; with k < i ^ m maps homomorphically to A, we 
have |0fc + i(D)| = • • • = |^ m (D)| = 0 and hence £( D) = r[Dj p 0. □ 

6 Quantifier aware width 

In this section, we introduce a new width measure of pp-formulas which 
we call quantifier aware width and show that it is related to the width of 
JjPP- formulas. 

We here assume all tree decompositions of pp-formulas to be nice. So 
let (T, ( Bt)teT) be a nice tree decomposition of a pp-formula fi. For every 
variable x of cj) let top(x) be the vertex t of T that is highest in T such 
that x £ B t . We call a tree decomposition of </> quantifier aware if for every 
3-component C of cj) and for all x £ V(C)\ free(^) and all y £ V ( C) n free(0), 
we have that top(y) is on the path from top(x) to the root of T. We call the 
quantifier aware width of a pp-formula (j>, denoted by qaw(^>), the minimal 
treewidth of a quantifier aware tree decomposition of cj) plus 1. 

We call a jJPP-formula cj) basic if it does not contain + nor subformulas 
of the form n. where n £ Z. 

In a sense made precise in Appendix [Bj we will show the following three 
properties of quantifier aware width. 
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• For every class of pp-formulas, having bounded quantifier aware width 
is equivalent to the tractability condition. 

• There is an FPT-algorithm parameterized by the quantifier aware 
width that given a pp-formula computes an equivalent pp-formula of 
minimal width. 

• Every pp-formula f has a representation by a basic $PP-formula <p' and 
for every basic jjPP-formula &' there is a pp-formula <f> “representing” 
it. Moreover, these translations can be performed so that qaw(^>) = 
width (<^/). 

7 Proofs of main theorems 

The following lemma shows that if we are only interested in ^PP-representations 
of pp-formulas, we may restrict ourselves to basic jjPP-formulas. 

Lemma 7.1 Let f be a pp-formula and let <f>' be a $PP -representation of <f>. 
Then there is a basic (JPP -sentence f" that is also a tfPP -representation off 
such that width(0") ^ width(^> / ) and jj-width(0 // ) ^ (j-width (ft). 

Proof .(sketch) Using a normalization result for (JPP-formulas (see Appendix ICl) 
and the results of Section [6l we can find numbers c^eZ and pp-formulas f" 
such that for all structures B 

£' 

[B,^] = 2c{K(B)|, (1) 

i—1 

where the (jf[ are pairwise not counting equivalent and have qaw(0") ^ 
width(0 / ). Note that dTJ) is a linear combination. 

Now note that [B,0'] = |0(B) |, which gives another linear combination. 

If follows with Theorem I5T1 that dT]) consists only of one summand with 
coefficient 1. Now applying the results of Section [6] yields the result. □ 

Proof, [of Theorem 14. 1 j fsketch! Observing that by Lemma 17. II we may our¬ 
selves restrict to basic [jPP-sentences, the statements can be shown with the 
results of Section [6] in a rather straightforward fashion. □ 

Proof, [of Theorem 14.2] (sketch) By Lemma 17.11 we may assume that the 
desired j)PP-representation is basic. But then the results of Section [6] directly 
yield the theorem. □ 

Proof, [of Theorem 14. 3| (sketch) Let first T* satisfy the tractability condi¬ 
tion. Then there is a constant k such that for all cores if of pp-formulas in 
$* we have tw(if) ^ k and tw(contract(y>)) + 1 < k. With the results of 
Section [6] it follows that qaw(^>) ^ 2k for such cores if. Using techniques 
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from [CM16] , we can express for each </> e $ and each structure B the |</>(B)| 
as a polynomial in unknowns of the form |^/(B)| where crucially the poly¬ 
nomial only depends on <j> and not on B. We can then substitute every term 
!?//(•)I by a tjPP-formula of width at most 2k with the results of Section [6l 
This yields a (JPP-formula of width at most 2k. 

For the other direction, assume that there is a constant k such that each 
ep-formula in <f> has a jjEP-representation ft with width (</>) ^ k. Let ^ be a 
pp-formula from d>*. We will show that is equivalent to a pp-formula of 
quantifier aware width at most k which completes the proof with Section [HI 

We first choose cj) e <F such that if appears in cf*. Let <p' be the j}EP- 
representation of <f> of width at most k. We first construct a linear combi¬ 
nation as in ((TJ) . Note that all summands of have quantfier aware 
width at most k as in the proof of Lemma 17.11 

We then construct a second representation of |</>(B)| as a linear combi¬ 
nation £2 (•)• A s in the first direction, for every structure we can express 
10(B)| as a polynomial in unknowns of the form |0(B)|. Arithmetic simplifi¬ 
cations and elimination of counting equivalent terms gives the second linear 
combination computing |0(B)|. We then argue that a term of the form 
c • |V’(B)| must appear in f^O)- 

We have that i\ and ti compute |</>(B)| and are hence equal. With 
Theorem 15.11 it follows that i\ contains a term that is counting equivalent 
to if. Since all summands of l\(.) have quantifier aware width at most k , 
the claim follows. □ 

Proof, [of Theorem 14.4] (sketch) Using Section [6] and normalization results 
( AppendixQ, cf can be turned into a linear combination as in (H|). Moreover, 
starting with any such representation yields the same linear combination up 
to counting equivalence of the summands by Theorem l5.ll Now turning this 
linear combination into a fjPP-formula yields a [(PP-representation and min¬ 
imizing the width of the summands with Theorem 14.21 gives a representation 
of optimal width. □ 
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A Missing material from Section [5] 

A.l Explanation of Lemma 15.21 

Let p be a univariate polynomial with positive integer coefficients and vari¬ 
able X. Fix a representation of p as a term with 1 and X as the inputs 
and where addition and multiplication are the operations. For a vocabulary 
r, let I T denote the r-structure with universe {a} and where each relation 
symbol Ret has R 1 = {(a,... , a)}. For each structure B over vocabulary 
r, we define p(B) as the r-structure obtained by evaluating the representa¬ 
tion of p by interpreting 1 as I r , X as B, addition as the disjoint union u 
of two structures, and multiplication as the product of two structures. 

Here, by the disjoint union AwB of two structures A, B, we mean the 
structure obtained as follows: rename the elements of A, B so that their 
universes A, B are disjoint, and then take the structure with universe Hu B 
where i? A ® B = R A u i? B for each relation symbol R. By the product A x B 
of two structures A, B, we mean the structure with universe Ax B and where 
rAxB = 5 ( a k ,b k )) | (ai,... ,a k ) e R A , (b i, ...,b k )e R B }. 

Lemma f5. 2 1 can be proved by a straightforward induction on the structure 
of p, using the observations that [<^(I r )| = 1, |</>(D u D')| = |0(D)| + |</>(D')|, 
and |0(D x D')| = |^(D)| - 

A.2 Proof of Lemma 15.31 

To establish this lemma, we make use of the following theorem established 
in prior work. 

Theorem A.l (Consequence of Theorem 5.f and Lemma 5.12 from [CM 16 ]) 
Let (f>i (Si ),..., <f> n (S n ) be liberal pp-formulas over the same vocabulary t, 
where each component of each (pi is liberal. Then there is a structure C 
(over t) such that for all pp-formulas <f> (over r) it holds that |0(C)| > 0; 
and for all i,j e [n] with i H j, it holds that |</>j(C)| / |</>j(C)|. 

Proof. fLemma 15.31) As each (pi(Si) is connected and liberal, Theorem lA.il 
provides that there exists a structure C' such that the values \cpi(C')\ are 
pairwise different. By taking a sufficiently large power P of C', we may 
obtain that for the structure C = C' p x A, the values q = \<f>i(C)\ are 
pairwise different. For each (ai,...,a n ) e [m] n , define pt ai ,...,a n ) to be a 
univariate polynomial over the rationals that evaluates to 0 at 0, and to a* 
at Ci (for each i e [n]). Define D to be the absolute value of the product of all 
denominators of coefficients in the defined polynomials. Set p\ ax a ) = ^ ‘ 
P(ai,...,a „); each such polynomial has integer coefficients. Next, set p7 a ^ 
to be the restriction of p'^ ai a ^ to summands with negative coefficients. 

Define P'U,...,a n ) to be P( ai ,...,a n ) + 2 I i (a 1 ,...,a n )e[rn]n{-P( ai ,... l a n) )- N ° W > for 
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each (ai,... ,a n ) e [m] n , define the structure B( 0l t ... t a n ) as P"( ai ari )(C); for 

each * e [n\, we have = I<Mp" 01 g„.)( C ))l = P'l ai ,...,a n )^ 

the second equality here holds by the Lemma 15.21 From these equalities 
and the definitions of P( ai ,...,a„) and P" ai a p it is straightforward to verify 
that the defined structures have the desired property. Our claim concerning 
each S( ai> Qn ) having the form A x E holds, as p" ai a ^(0) = 0 (for each 

(ai,...,a n ) e [m|"), implying that the structures B( 0l provided can 

be obtained in the form C x -, which has the form Ax-. □ 


B The details for Section |6] 

Remark B.l The quantifier aware width can be arbitrarily higher than 
their treewidth. To see this consider the formula <f> = 3z Aie[n] E{ x i, z). 
The primal graph of is a star, so it has treewidth 1. We claim that the 
quantifier aware width of is n + 1. To see this, observe first that the 
free variables x \,..., x n must appear above top(z) in a bag of any quanti¬ 
fier aware tree decomposition. But since XiZ is an edge in the primal graph 
for every i, the variable Xi must also appear in a common bag with z and 
consequently also in top(^). Thus top(z) must contain n + 1 variables. 

We first show that qaw(^>) is, in a sense, characterized by tw(^) and 
tw(contract(</>)), for every pp-formula cf. Consequently, quantifier aware 
treewidth will allow us to characterize tractable classes of pp-formulas for 
counting. 

Lemma B.2 For every pp-formula <f> we have 

max{tw(0), tw(contract(^>))} + 1 
^qaw(</>) ^ tw(</>) + tw(contract(<^>)) + 1. 

Proof. The main idea of the proof is that forcing the free variables of a free 
component to appear above the quantified variables has a very similar effect 
as connecting them to a clique in the construction of the contract((/>). So 
from a quantifier aware decomposition we get a decomposition of contract^) 
by restricting to the tree decomposition to the free variables. For the other 
direction, we can add the quantified variables to a tree decomposition of 
contract^) in a straightforward way. We now give the details. 

For the first inequality, observe first that any quantifier aware tree de¬ 
composition of f is a tree decomposition of <f, so tw(^>) + 1 < qaw(^>) is obvi¬ 
ous. To prove the inequality tw(contract(</>)) + l ^ qaw(^), let (T, (. B t )teT) be 
a quantifier aware tree decomposition of cf. Introduce for every 3-component 
of 4> a new vertex vc- Then substitute in every Bt every non-free variable x 
of cj) by vc where C is the 3-component that contains x. Call the result B’ t . 
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We claim that (T, ( B' t ) te T ) is a tree decomposition of contract(</>). To see 
this, note that by the same argument as in Remark IB. II we have for every 
3-component C a bag that contains ( V(C ) n free(0)) u {uc}- 

For the second inequality, first compute a tree decomposition (T, ( B t )teT ) 
of the contraction contract(</>). Note that for every 3-component C of <p 
there is a bag B t * that contains V{C ) n free(</>), because this variable set 
forms a clique in contract^) and it is well known that for every clique in 
a graph every tree decomposition must contain a bag that contains this 
clique completely. Now compute a tree decomposition (T\ () for 
G[R(C)\free(</>)], where G is the primal graph of <fi. Then construct for 
every t e T' a new bag B" := B' t u ( V{C ) n free(^)). Finally, connect T 
to T' by connecting an arbitrary vertex of T' to t*. Doing this for every 
3-component yields a quantifier aware tree decomposition of < j>. Moreover, 
the width of the decomposition is at most tw(contract(c/>)) + tw(^) + 1 which 
completes the proof. □ 

We now show how to compute quantifier aware treewidth which will 
allow us to compute tjPP-formulas of optimal width. 

Lemma B.3 Computing quantifier aware tree decompositions of minimal 
width of pp-formulas is fixed-parameter tractable, parameterized by the quan¬ 
tifier aware width. 

Proof. The idea of the proof is to add edges to the graph of <f> in such a 
way that the treewidth of the resulting graph is exactly qaw(ij!>); then, we 
apply standard algorithms for computing treewidth. To this end, let 4> be 
a pp-formula with primal graph G and S := free(</>). For each 3-component 
C of (j) choose a vertex xc £ R(C')\free(</>) and connect it to all vertices 
y e V(C ) n free(0); moreover, connect the vertices in V(C) n free(^) by 
a clique. Call the resulting graph G'. We will show that the minimum of 
tw(G') + 1 over the choices of the xc is qaw(^). 

We first show that for every choice of the xc we have tw(G / ) + l ^ qaw(0). 
To see this, fix a tree decomposition (T, ( B t )teT ) of G'. Since R(C')\free(i?i)) 
is connected in G’ , the bags containing R(C)\free((?i>) are contained in a 
subtree T' of T. Moreover, because {x c } u ( V(C ) n free(^)) is a clique in 
G', we know that {x c } u {V{C) n free(</>)) ^ B t * for some t* in T'. Since 
none of the vertices in V (C)\free((/>) have any neighbors outside of V(C), we 
may assume that t* is the root of T'. Then it is easy to see that (T, (Bt)teT) 
can be turned into a quantifier aware tree decomposition: We only have to 
potentially add a new bag B t ** := V(C) n free(V/i) and a vertex t** in the 
decomposition. Then connect t** to t* and its parent and delete the edge 
between t* and its parent. 

For the other direction, let (T, ( B t )teT ) be a quantifier aware tree de¬ 
composition of (j). We will show that it is also a tree decomposition of G 1 
for a choice of the xc- First note that by the same argument as before, the 
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vertices of V (C) n free(</>) are contained in a subtree T' of T. Let x' c be 
the only variable of L A (C')\free(^)) that is contained in B r where B r is the 
root of T' . Note that by the same argument as in Remark IB. 11 we know 
that B r contains V (C) n free(</>). Thus B r covers all edges introduced in the 
construction of G' when choosing xc = x' c . Thus (T, ( B t )teT ) is indeed a 
tree decomposition of G' for the right choice of the xc- 

Since computing tree decompositions is fixed parameter tractable param¬ 
eterized by the treewidth (see e.g. [FGflbj l. the only problem left to solve 
is the right choice of the xc- But since the quantified variables of the dif¬ 
ferent 3-components are independent, we can do this choice independently 
for every 3-conrponent C as follows: Construct G" by choosing a vertex 
xc e V(C) n free((^) and proceed as in the construction of G' . Now for all 
other 3-components C' connect V{C') n free (< 5 !) to a clique and delete all 
variables in V(C r )\free((p) . Clearly, trying all potential choices of xc lets us 
optimize the choice for C. Doing this for all 3-conrponents gives the desired 
choice and thus the optimal quantifier aware tree decomposition. □ 

We call a ((PP-formula (f) basic if it does not contain + nor subformulas of 
the form n, where n e Z. We now show that basic fjPP-formulas correspond 
very closely to pp-formulas. 

Lemma B.4 a) For every basic (jPP -sentence ft there exists a pp-formula 
(j) that ft represents with 


width (0') ^ qaw((/>), 


and 

t)-width(0 / ) ^ tw(contract(0)) + 1. 

b) For every pp-formula cf there exists a basic JjPP -sentence ft that repre¬ 
sents (j) with\N\dth(ft) ^ qaw {ft), and ((-width (ft) ^ tw(contract(i^))-|-l. 

Proof. The proof relies on the observation that the condition on top(x) 
in quantifier aware tree decompositions corresponds closely to the fact that 
free variables can only be closed by a P-quantifier in jjPP-formulas after 
the contained pp-formula has been casted by a C-quantifier. With this in 
mind, a (jPP-formula is transformed into a pp-formula by making use of 
and inducting on the flPP-formula’s structure (viewed as a tree). The other 
direction is similar. 

a) Let ft be a basic JjPP-sentence. By potentially renaming quantified 
variables, make sure that every variable in ft is either free or quantified 
exactly once. We construct f by deleting all C -, E- and P-quantifiers and 
substituting all x by a. Obviously, the result is a pp-formula. Note that 
for every subformula ft of ft, we have an associated subformula if of cj>. 
For every subformula if of we define lib(^) to be the variables of ft that 
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are not quantified in if We claim that for all (j-subformulas ip' and every 
assignment h to free(ip’) that 

[B, ip'](h) = | {h! : lib(g/>) —» B \ h' extends h, (B ,h') \= ip}\. (2) 

We show @ by induction on the structure of basic (jPP-formulas. If 
ip' = Cip" for a pp-formula ip" , then we actually have ip = ip". Moreover, h 
assigns to values to all liberal variables of ip, so both sides of © are 1 if and 
only if h satisfies ip. If ip' = EVip" or ip' = PVip" , we get ([2]) directly from 
the semantics of jj-formulas and induction. Finally, if ip’ = ip' x x ip' 2 , we have 
that lib( , 0i) n li b(ip 2 ) Q lib(^). Therefore, lib(0i) and lib(-02) overlap only 
at variables where a mapping h! : lib(?/>) —> B is defined, and <[2|) follows 
easily. 

It remains to show the inequalities of the width measures. To this end, 
consider the syntax tree T of <p' . For every node t of T, define Bt := free(</>£) 
where <p' t is the subformula of (p r that has t as its root. Note that (T, ( B t ) t eT ) 
satisfies the connectivity condition and is thus a tree decomposition of cp 
of width width(0 / ) — 1. Also, (T, (Bt)teT) is quantifier aware because in <p' 
existential quantification is only allowed in the pp-part in which all free vari¬ 
ables of (p are still free. This shows width(</> / ) ^ qaw(^). Now observe that 
by deleting all bags that contain quantified variables we end up with a tree 
decomposition for contract^). This shows jj-width(^) ^ tw(contract(</>)) + 1. 

b) Let now <p> be a pp-formula and let (T, ( B t )teT ) be a nice quantifier 
aware tree decomposition of cp of width k — 1. For every 3-component C 
of (p, the vertices F(C)\free((/>) all lie in the bags of a subtree Tq of T. 
Moreover, we may w.l.o.g. assume that the bags in Tq do not contain any 
vertices not in V(C). Finally, we have that the bag B rc where r c is the root 
of Tq contains V(C) nfree(^), because (T, (Bt)ter) is quantifier aware. The 
results of [DKVD2] assure that there is a pp-formula (pc of width k that is 
logically equivalent to the pp-formula that we get by restricting cp to the 
atoms that have all of their variables in V(C). 

We now construct for every t £ T such that Bt does not contain any 
quantified variables of <p a basic f)PP-sentence <p' t . So let 1 be a node of t 
with the desired properties. Let atom(t) be the atoms of cp containing only 
variables in B t and set (pt := riv>eatom(t) If ^ bas no children, set cpt := (pt■ 

If t has a child t' such that Bf\ficee((p) A 0, then t has only that one 
child because (T, ( B t )teT ) is nice. Let C be the unique 3-conrponent of the 
variable in B t i\hee((p). We set (pt := (pt x C(pc- 

If t has a child t' such that B t '\free((p) = 0 and a variable x is forgotten 
when going from t' to t, then set (pt '■= Px<pf. 

If t has a child t' and a variable x is introduced when going from t! to t, 
then set (pt := (pt x Ex(p t '. 

If t has two children t\ and t- 2 , then note that Bf 1 \free(^>) = Bt 2 \free((p ) = 
0. Moreover, iree((p tl ) = free(^>t 2 ). We define (pt := x (pt 2 . 
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Set cp' := Piree(cp r )(p r where r is the root of cp. 

An easy induction along the construction of cp similar to that in a) shows 
that cp 1 does indeed compute the correct value for every structure B. More¬ 
over, the width of cp' is at most k which completes the proof. 

If we do not have a bound on qaw(0) but only on tw(contract(0)), the 
same construction as above yields the the bound tw(contract(0)) + 1 ^ 
jj-width((//). The only difference is that we do not have to bound the width 
of the pp-formulas with |DKV02j . □ 


C Normalizing jJEP-formulas 

We call a jjPP-formula constant if it is only constructed from constants in 
Z, x, and P- and ^-quantifiers. We call a j)PP-formula flat if it is of the 
form V’i x 4>i where where the ip \,..., ip n are constant and (pi,..., (pg 

are basic (jPP-formulas. 

The result of this section is the following normalization. 

Lemma C.l There exists an algorithm that computes, for a given (jEP- 
formula cp, a logically equivalent flat jjPP -formula <p' such that width(<?!/) ^ 
width(0). 

We prove a sequence of lemmas to aid us. 

Lemma C.2 There exists an algorithm that computes, for a given ftEP- 
formula (p of the form C(ip,L), a logically equivalent (jPP -formula cp' such 
that width((//) ^ width(^). 

Proof. In a first step, we transform ip into a logically equivalent ep-formula 
V ’ d = VL ;1 ip, where the ipi are pp-formulas; this can be done without in¬ 
creasing width jChe!4j . Then we claim that (p is logically equivalent to 
cp' = 2jc[ s ] 1)I J I +1 ) Hisj C(V’i) L). First note that this is a well- 

formed {(-formula, because for all additions and multiplications the free vari¬ 
ables of all operands are L. It remains to show that cp' is logically equivalent 
to <p. So fix B and h : L —> B. 

If h does not satisfy ip, then [B,</>](h) = 0. Since h does not satisfy any 
ipi, it is easy to see that [B,</>](/i) = 0 as well. 

Assume h satisfies ip-, say that ipi,..., ip£ are the disjuncts that it satisfies. 
By definition [B,0](h) = 1. Also, 

[B ,<P'](h)= 2 (- l)\ J \ +1 Yl[B,C(iPi,L)](h)= (-1) |J|+1 ; 

ieJ 

this latter quantity is equal to the sum 
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It is readily seen that the width of cf>' is not bigger than that of <p. □ 

We call an {(-formula 4 --free if it does not contain + . 

Lemma C.3 There exists an algorithm that computes, for a given flEP- 
formula cp, a logically equivalent (jPP -formula cp 1 of the form 2i=i 4>i where 
the 4>i are +-free such that width(^Z) ^ width(^). 

Proof, (idea) The proof is by straightforward induction on the structure of 
<p, pushing all occurences of + up in the formula; for instance, one proves 
that PV(ip i + if 2 ) is logically equivalent to PVipi + PVip 2 - The base case 
cp = C(ip, L ) is Lemma 1C.21 □ 

The proof of the following lemma is by a straightforward induction. 

Lemma C.4 There exists an algorithm that computes, for a given constant 
JjPP -formula <p, a logically equivalent (JPP -formula <p' = EViPV 2 n with n e Z 
such that width(0 / ) ^ width(0). 

Lemma C.5 There exists an algorithm that computes, for a given +-free 
JjPP -formula (p, a logically equivalent (JPP -formula <p> = 'tp\ x ip 2 where tpi is 
constant and ip 2 is basic such that width (<//) ^ width (cp). 

Proof, (sketch) The proof is again straightforward induction in the style 
of Lemma 1C.31 We consider only the the case of P-quantifiers which is the 
only case that is not completely clear from the definition. 

So let <p = PVcp 1 where cp' = cp 1 x cp 2 such that <p\ is constant and (p 2 
is basic. Note that by Lemma 1C.41 we may assume that cp\ = PViEX^n for 
some n e Z. We claim that cp is logically equivalent to <p\ x PV cp 2 where 
(pi = PViEiyffV). To see this, consider a structure B and an assignment to 
cf. Then [B ,PVcP'}{h) = £ V [B,& x foW) = 2,,([B ,&]&) • [B, (p 2 ](h')) 
where the h' are as in the definition. Now choose an arbitrary assignment h" 
to 1 cp', then [B,^>i](/i") = [B, <pi\(h') = [B, <p'i\(h) for all h!. Consequently, 
[B ,PV<P'](h) = [B,^]( h»)-Z h ,[B,cP2\(h') = [B A' 1 ](h)-Z h ’[B,cP2W) = 
[B ,cP[](h) • [B ,PV<h](h) = [B, (p[ x PVcp 2 ](h). □ 

Proof. [of Lemma IC.lj First use Lemma lC.3l to turn cp into a sum of +-free 
(jPP-formulas. Then apply Lemma 1C.51 to each of the summands. □ 


D Full proofs of main theorems 

Remember that we call a (JPP-formula cp basic if it does not contain + nor 
subformulas of the form n, where ne Z. The following lemma shows that if 
we are only interested in jJPP-representations of pp-formulas, we may restrict 
ourselves to basic JJPP-fornrulas. 
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Lemma D.l Let cf be a pp-formula and let (ft be a ftPP -representation of 
cf. Then there is a basic j)PP -sentence eft' that is also a jjPP -representation 
of f such that width (ft 1 ) ^ width (eft) and )j-width((/> // ) ^ fj-width (<//). 

Proof. With Leiriina fC.il we may assume that (ft is flat, so let (ft = Xli=i ft'i x 
(fl where if[ is constant and <f\ is basic. Then using Lenrnra IB. 41 (a) we get 
that for every structure B, [B, (ft] = Xli=i Cj|<MB)|, where c* £ Z and (pi is a 
pp-formula. Now combine the sunrnrands of counting equivalent pp-formulas 
to get a linear combination with 

i' 

[B,4/] = 2c'|0"(B)|, (3) 

1=1 

where the cf" are pairwise not counting equivalent pp-fornrulas and c\ e 
Z\{0}. Note that for all f>" we have qaw(^>") ^ width(<//) and tw(contract(<^>")) ^ 
fj-width (0'). 

Since <f>' is a (jPP-representation of (p, we have [B,<//] = |^>(B)| for all 
structures B. By Theorem 15.11 it follows that the linear combination in 
([3]) consists only of one sunrnrand with coefficient 1. Let |-0(B)| be that 
summand. We have |-0(B)| = |^»(B)|. Now we apply Lenrnra IB. 41 (b) on 
input if to construct (p with the desired properties. □ 

Airother ingredient that we will use in the proofs of the main theorems 
is a syntactic characterization of counting equivalence. To this end, we say 
that two pp-fornrulas (A, V ), (A', V') over the same signature are renaming 
equivalent if there exist surjections h\ : V —* V' and /12 : V' —> V that can 
be extended to homonrorphisnrs h : A —* A.' and h' : A' —* A, respectively. 

A crucial result of [CM16] is that renaming equivalence is a syntactic 
characterization of counting equivalence. 

Theorem D.2 ([ CM 16 1 Two pp-formulas (pi(V), (p 2 (V r ) are counting equiv¬ 
alent if and only if they are renaming equivalent. 

We will make use of the following easy corollary. 

Corollary D.3 Let <p(V) and <ft(V') be counting equivalent pp-formulas. 
Then there is a pp-formula if that is logically equivalent to (f such that 
qaw('i/>) ^ qaw(<//) 

Proof. By Theorem ID. 21 we have that <f> and (ft are renaming equivalent. 
Let h\ and (12 be the surjections that witness renaming equivalence. Note 
that, since V and V' are finite, both hi and /12 are bijections. Asume 
w.l.o.g. that <f and (ft are on disjoint variable sets. We then construct if 
from cf by renaming each variable v £ V' to h'~ l (y). Obviously, eft and if 
are renaming equivalent and this witnessed by the functions hf 1 and h^. 
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But then cp and ip are also counting equivalent and this is witnessed by the 
identity mapping. By the work of Chandra and Merlin [CM7 7] discussed in 
the preliminaries it follows that cp and ip are logically equivalent. The bound 
on the quantifier aware width is obvious. □ 

Proof, [of Theorem 14.1 j We start with the second statement. Let first $ 
satisfy the contraction condition. Then there is a constant k such that for 
all cores cp of pp-formulas in <J> we have tw(contract(0)) + 1 < fc. But then 
Lemma IB. 41 yields basic (JPP-representations cp' with jj-width(<//) ^ k. 

Now assume there is a constant k such that every formula cp in has 
a (JPP-representation <p' such that jj-width (<//) ^ k. By Lemma ID.II we may 
assume that cp' is basic. Then by Lemma IB.4I we find a pp-formula <f>" 
that is counting equivalent to cp such that tw(contract(0")) + 1 ^ k. Using 
Corollary ID. 31 it follows that (p is logically equivalent to a formula ip with 
tw(contract((/>)) + 1 ^ k. Consequently, $ satisfies the contraction condition. 

For the first statement, let first satisfy the tractability condition. Then 
there is a constant k such that for all cores (p of pp-formulas in we have 
tw(</>) ^ k and tw(contract(</>)) + 1 ^ k. It follows that qaw (<p) ^ 2k by 
Lemma IB. 21 Then Lemma IB.4I yields a basic $PP-representation cp' with 
width(<//) ^ 2k. 

Now assume there is a constant k such that every formula cp in <h has 
a jjPP-representation cp' such that width((//) ^ k. We may again assume 
that cp' is basic. Then Lemma IB.4I gives a pp-formula <p" that is counting 
equivalent to (p such that qaw (cp) k. Using Corollary ID. 31 it follows that 
cp is logically equivalent to a formula ip with qaw (cp) ^ k. Now applying 
Lemma lB.21 shows that satisfies the tractability condition. □ 

Proof. [of Theorem 14. 2 j By Lemma ID. II we may assume that the desired 
(jPP-representation is basic. But then the correspondence of Lemma IB.41 in 
combination with Lemma lB.31 directly yields the result. □ 

To prove 14.11 we will use the following lemma that summarizes the main 
construction of the proof of Theorem 3.1 in [CM16] . 

Lemma D.4 For every set of ep-formulas there is a set T* of pp-formulas 
such that the following holds: For every cp e there are pp-formulas cpi,...,<pgs 
T*, pp-sentences ipi,... ,ipt e <I>* and non-zero integers ci,..., q such that 
the following holds for every structure B. 

• If B satisfies one of the sentences ipi, then 0B) = | 5 | free (<M. 

• Otherwise, we have 10(B) | = Ym.=i c* 10* (B) |. 

Moreover, the (pi are pairwise not counting equivalent and there is no homo¬ 
morphism from any sentence ipi to any ipj for i =£ j and no homomorphism 
to any (pj. 

Finally, for every cp' e 4>* we have that cp' appears as a <pi or ipi as above 
for a (p e T. 
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Proof, [of Theorem 14.3| Let first 4** satisfy the tractability condition. Then 
there is a constant k such that for all cores ip of pp-formulas in <J> we 
have tw(i p) ^ k and tw(contract(V>)) + 1 ^ k. With Lemma IB.21 it fol¬ 
lows that qaw(^) 2k for the cores ip of the pp-formulas in <f>. Then 
Lemma lD.41 provides pp-formulas ft ,...,(pi e 4>*, coefficients Ci,..., q, 
and pp-sentences ft,...,ipt e 4?* such that the following holds for every 
structure B: If B satisfies any sentence ft, then |</>(B)| = |£||fr ee wb oth¬ 
erwise, |</>(B)| = 2i =1 Cil&(B)|. Let |free(0)| = r. Then |0(B)| is equal to 

(nli(i - mmMhh cii^(B)i) + i^ra -n-=i(i - mm))- now let 

<f>i,..., <f >2 be the jj-formulas we get by applying Lemma IB. 41 on ft,..., ft 
and then multiplying respectively by the ([-formulas 

Eiiee(cj))ci ,..., Eiree((p)c£ 

that correspond to the coefficients c*. By Lemma IB.41 we have that the 
width of each ft is bounded above by 2k. 

Then set V’ to be Plib(</>) applied to the sum of 

t t 

(Mree(0)(l - C(\j ft, 0))) x (J] (pft 

i =1 i—1 


and 

t 

Efree(<p)C (\J i,0). 

1 = 1 

Clearly, ^ is a )(EP-representation of (p. Moreover, the width of ip is the 
maximal width of the ft and the ft i: which is bounded above by 2k. 

The first direction of the second item can be proved completely analo¬ 
gously. 

For the other direction, assume that there is a constant k such that 
each ep-formula in has a f|EP-representation ft with width(</>) ^ k. Let 
ip be a pp-formula from <f>*. We will show that ip is equivalent to a pp- 
formula of quantifier aware width at most k which completes the proof with 
Lemma IB. 41 

We first choose cp e <!> such that ip appears as a (pi or ipi as in Lemma IB. 41 
Let ft be the JjEP-representation of <p of width at most k. With Lemma lC.il 
we may assume that ft is fiat, i.e., it has the form X!i=i ft x ftn where the 
0) are constant and the <p\ are basic. As in the proof of Lemma EH this 
yields for every B 


\ftB)\ = j]ci\ft(B)l (4) 

i =1 

where the ft are pairwise not counting equivalent. Moreover, qaw(^j) ^ k. 
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We will now construct a second representation of |^>(B)| as a linear 
combination. To this end, let |lib(</>)| = r and let (p\ ,...,(p£, ipi ,..., ipt 
and c\,..., C£ be as in Lemma ID. 41 Then as before, for every structure 

i0(B)i - nl,(i - i*(B)i)(s',,c i i*(B)i) + ier(i * nli(i - ia(b)d). 

Now multiplying the righthand side out, we get that |0(B) | can be ex¬ 
pressed as a weighted sum of terms of the form |<^j(B)| • Y\j e j IV’j(B)! an d 
\B\ r Y\jeJ |V’j (B) ] . These terms are equivalent to 

Ui a/\^)(B)| 
jeJ 

and 

W a /\Vi)(B)|, 
jeJ 

since the ipj are sentences; here, ip r is the true formula with liberal variables 

lib 0). 

Now combine counting equivalent summands to get a linear combination 

MB)| = 2XK(B)|. (5) 

2 = 1 

We claim that the linear combination in © contains c • |-0(B) | or c • 
\ip r a , 0(B)|for some c ^ 0 as a summand. To see this, consider first the 
case that ip is not a sentence. Note that in this case c ■ ip( B) appears in 
the weighted sum from Lemma lD.41 and thus (for J = 0) also in the sum 
we get before combining counting equivalent summands. Moreover, ip is not 
counting equivalent to any other summand ip' in this sum. To see this, note 
first that by Lemma [D. 41 there is no homomorphism from any of the ipj to ip. 
Moreover, the (pi are pairwise not counting equivalent. Thus c\ip\ appears 
in ([5]) in this case. If ip is one of the sentences ipi , then ip r a ip is contained 
in the sum before combining counting equivalent summands (for J = {i}). 
Obviously, ip r a ip is not counting equivalent to any conjunction that contains 
a non-sentences ipi. Moreover, since there are no homomorphism between 
ip = ipi and ipj by Lemma ID. 41 ip r a ip is not counting equivalent to any 
other summand, so it must be contained in ©. 

It follows that and © give two linear combinations that are equal 
for every structure B. Using Theorem 15.11 it follows that (© contains a 
summand c- \ip\ that is counting equivalent to ip. Moreover, qaw('i/>) ^ k. It 
follows with Corollary ID. 31 that ip is logically equivalent to a formula with 
quantifier aware width at most k. □ 

Proof, [of Theorem I4.4J (sketch) By Lemma IC.ll we may assume that ip is 
flat, i.e., it has the for i p = X^ =1 ipi x cpi where the ipi are constant and the 
cpi are basic. Note that width(</>) = max, e £(width(^)j)). 
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Note that every flat jjEP-representation of <f> can be turned into a lin¬ 
ear combination as in Q. Moreover, starting with any such representation 
yields the same linear combination up to counting equivalence of the sum¬ 
mands by Theorem 15.11 Now turning this linear combination into a flat 
flPP-formula yields a jjPP-representation and minimizing the width of the 
summands with Theorem 14.21 gives a representation of optimal width. □ 
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